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We report the first draft genome sequence from a member of the genus Agrococcus, isolated from cold thrombolytic microbial- 
ites within Pavilion Lake, British Columbia, Canada. The draft genome assembly for Agrococcus pavilionensis strain RW-1 has a 
size of 2,878,403 bp with a G + C content of 72.56%. 
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Groth et al. (1) first described the genus Agrococcus based on 
two strains of Agrococcus jenensis isolated from soil and the 
surface of sandstone. The genus Agrococcus is classified within the 
family Microbacteriaceae, members of which exhibit the unusual 
feature of having diaminobutyric acid within the cell wall, which 
imparts a distinctive yellow color (1). The genus has eight de- 
scribed species that have been isolated from a wide range of envi- 
ronments, including air (2), a coal mine (3), cheese (4), cold des- 
ert soil (5), forest soil (6), a medieval wall painting (7), dried 
seaweed (8), and the phyllosphere of potato plants (9). The 
genetic richness and diversity of this genus is poorly known, 
with little genetic information available other than for the 16S 
rRNA genes. 

The source of Agrococcus pavilionensis strain RW-1 is the 
oligotrophic (3.3 u,g liter -1 total phosphorus) Pavilion Lake 
(50.86677°N, 121.74191°W), which lies in Marble Canyon near 
Lillooet, British Columbia, Canada. Pavilion Lake harbors a 
diverse array of modern microbialites, which are contemporary 
biogenically derived carbonate structures (10, 11). Pavilion 
Lake microbialites consist mainly of clotted and nonlayered 
thrombolitic structures (10) that occur in the permanently cold 
(4 to 8°C) water deeper than 5 m (11). A. pavilionensis was 
isolated from a cabbage-shaped thrombolite collected at a depth 
of 20 m. DNA was extracted using Qiagen QIAamp followed by 
Qiagen MinElute cleanup columns. The Illumina MiSeq library 
was constructed using the Lucigen NxSeq library prep kit without 
final PCR enrichment. 

Whole-genome shotgun sequencing was completed using Illu- 
mina MiSeq in the 250-bp paired-read format. A partial flow cell 
obtained 2.89 million raw reads with 713,936,519 bp of raw se- 
quence. Paired reads were error corrected and connected using 
AUPaths-LG (version 44837) (12). In the data set, 31mers were 
counted using Jellyfish (version 1.1.10) (13). Reads that contained 
3 lmers with a multiplicity of > 1,250 were partitioned for de novo 
assembly. The partitioned reads were assembled using Celera as- 



sembler 7.0 (14). The high-copy reads assembled as a single contig 
of 1,427 bp in length representing a high-copy plasmid. The re- 
maining reads assembled as 50 contigs summing 2,878,403 bp 
(N 50 length, 133,224; N go length, 31,609; G+C content, 72.56%). 
The 16S rRNA gene sequence was confirmed by Sanger sequenc- 
ing and was found to have 99.99% identity to the 16s rRNA gene 
predicted from the draft genome. 

Annotation was conducted on the RAST server using the Glim- 
mer 3 option (15) and predicted 2,506 protein-coding genes, in- 
cluding 48 noncoding RNA genes and 126 predicted SEED sub- 
system features. The potential to metabolize a wide range of 
carbon compounds is predicted from the genome, including 
D-ribose, fructose, lactate, glycerate, chitin, deoxyribose, and de- 
oxynucleoside catabolism. Genes related to those encoding the 
phosphate (Pho) regulon for high-affinity uptake of phosphate 
and cold shock proteins were also found. 

Further analysis of the genome, including functional and bio- 
chemical measurements, will be used to understand the possible 
roles of A. pavilionensis in the highly diverse microbial community 
contained within the Pavilion Lake microbialites. This is the first 
draft genome for the genus Agrococcus, which will provide a tem- 
plate for many further phylogenetic, comparative genomic, met- 
agenomic, and functional studies of this widely distributed genus. 

Nucleotide sequence accession numbers. This Whole- 
Genome Shotgun project has been deposited at DDBJ/EMBL/ 
GenBank under the accession no. ASHR00000000. The version 
described in this paper is version ASHR01000000. 
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